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(57) Abstract 

Link Ids are associated with file handles (104-105) in 
a directory structure in a computer operating system. The 
Link Ids allow a file handle to be mapped uniquely to a 
pathname for a file associated with the file handle. In one 
implementation lists are used to facilitate fast searching (208) 
of directory structures for a name associated with a Link Id. 
The list includes entry pairs where each entry pair is a Link 
Id and a directory number where a name associated with the 
Link Id may be found. 
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RECONSTRUCTING DIRECTORY PATHNAMES FROM FILE 
HANDLES IN A COMPUTER SYSTEM 

BACKGROUND OF THE INVENTION 
This invention relates generally to identifying 
files in computer operating systems and specifically to 
reconstructing a pathname from a file handle maintained by an 
operating system. 

Traditional computer operating systems use various 
ways to organize and manage files stored within a computer. 
One popular way to organize files is with a directory 
structure that allows files or directories to reside within 
other directories. When a directory resides within another 
directory, the residing directory is referred to as a "sub- 
directory." A file ultimately resides in a "parent" 
directory. The file's parent directory may, in turn, have a 
parent directory, and so on, thus creating a hierarchy of 
directories. This hierarchy of directories, each of which may 
contain files and additional directories, results in a 
directory tree structure such as the one shown in Fig. ia. in 
order to access a particular file, each of the parent 
directories in a chain from a starting, or "root," directory 
are named in a string, along with the file's name. The 
resulting string is called a "pathname." 

Fig. 1A shows a prior art directory structure 
including root directory, "directoryl, " which contains files 
such as "filel," «file2," "fii e3 " and »file4." Directoryl 
also includes directories directory2 and directory3. 
Directory2, in turn, contains files files and filee. 
Directory3 contains files fil e7 and files and also contains 
directories directory4 and directoryS. Directory4 contains 
files file9 and fileio. In practice a directory can contain 
any number of files or directories. 

The operating system allows a user of the computer 
system to create, access and manipulate files and directories 
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within a directory structure such as the directory structure 
shown in Fig. ia. For example, a user can create a file and 
specify that the file be placed in a particular directory. 
The user can also delete or move files. Similarly, for 
directories, the user can specify that a directory'be created 
or deleted. Some operating systems, such as operating systems 
based on the Unix operating system (i.e., Unix-like operating 
systems) , allow a user to create "links- between files so that 
a file can be accessed by more than one name. 

In Fig. ia, file? is referenced by the text string 
"/dire C toryl/directory3/file7". The directory, directory! is 
referred to as the "root" directory and is always given as' the 
starting directory in a pathname where the pathname specifies 
an absolute path to a file. An absolute pathname uniquely 
identifies any file within a given file system without 
requiring further information. Another form of pathname is a 
relative pathname, which identifies a file by using the 
relative path and a reference point such as a "current- 
directory or starting point. 

For example, in a relative path, assuming the 
current directory is directoryS, file? may be referenced 
merely by giving the name of the file as "file?". Another 
example of a relative path, assuming the current directory is 
directoryl, is to use the path "directory 3 / f ile?" to access 
file?. For ease of discussion, this application describes the 
invention in terms of absolute pathnames, it will be apparent 
that the concepts presented herein are equally applicable to 
relative pathnames. For general information on the Unix 
directory structure and pathnames consult, e.g., iso/iec 9945- 
1, IEEE 1003.1-1990. 

While pathnames are convenient ways for human users 
to specify directories and files, the operating system uses a 
more computationally convenient internal representation of a 
pathname called a "file handle". Typically, a file handle is 
a unique number or group of numbers and may also include other 
information that uniquely identifies an item such as a 
directory or file residing within the computer system. 
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Each file resides on a file system where a file 
system represents one or more disk drives. The file system 
containing the root directory is called the root file system 
Each file system has it's own file hierarchy headed by a root 
directory. The inode number for a root directory is always 
by convention, the number 2. Inode numbers are unique within 
a fUe system. File systems are grafted onto the root file 
system by a process called mounting. For example, if filB 5 
C/usr/x") is a directory then a second file system could be 
mounted on that directory as shown in Fig. 2. 

In Fig. ic, File System 2 has been mounted on File 
System i at the directory with inode number 5 ("directoryS") 
of Fig. ia. Note that, after the mount, the root directory of 
File System2 has effectively replaced File System i«s 
directory in the file hierarchy. DirectoryS will not be 
visible until File System 2 is unmounted, once the mount has 
taken place, access to files in File System 2 is made with 
pathnames beginning with «/usr/x«. For example, to access 
file 6 in File System 2, the pathname "/usr/x/q/s" would be 
used. 

The process of locating a file using a pathname is 
called pathname resolution. The product of pathname 
resolution is a file handle, a file handle is used by the 
operating system internally to refer to a file without having 
to resolve the file's name again. The file handle returned is 
typically a combination of the file system number (usually 
called a device number) and the inode number. Within this 
paper, file handles are represented using a pair of integers 
enclosed by parentheses (e.g., (2,6)). 

Renaming files across file systems is not allowed 
(for example, in Fig. id it would be illegal to rename 
"/usrl/y" to "/usr/x/p/f ») . 

A useful feature of Unix systems is that files can 
have more than one name, although most implementations allow 
directories to have only a single name. The link() system 
call is used to add a new name for an exisiting file. For 
example, if the call link ( - /usr/x/q/r" , "/usr/x/p/f) were 
made in an operating system including the directory structure 
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of Fig. ic, the result would be as in Fig. ID. Now either of 
the pathnames «/usr/x/q/r« or "/usr/x/p/f » can be used to 
access file (2 ,5). Note that this is not a situation where 
one name is the primary najne and tne Qther ig an alias __ ^ 
names can be used equally and the original name can be removed 
without affecting the new name. 

As is the case with rename, the link() system call 
generally does not allow links between file systems. 

A problem exists with the file organization in that 
given a file handle, there may be more than' one pathname that' 
resolves to the file represented by the file handle. This 
makes it difficult to regenerate a pathname based on a file 
handle. Por example, given file (2,5), because of the link 
call discussed above, there would be two possible pathnames to 
the fxle. W i tn only ^ liaited information provided by the 
mode number, it is impossible to tell which pathname was 
originally used to create the file handle, i.e., the inode 
number. 

Therefore, it is desirable to have a system where a 
file handle that is one of multiple file handles for the same 
file can be used to regenerate the pathname that was 
originally resolved to create the file handle, such a system 
would be useful, for example, to generate a pathname that was 
used to open a file in the case where the file has several 
fxle handles and pathnames. 

SUMMARY OF THE INVENTION 
To uniquely map a pathname to a given file handle, 
the system of the present invention uses additional 
information associated with the file system inode numbers, or 
file handle. Specifically, the system uses a number called a 
"link Id" and the inode number of the parent directory a 
Link Id is a number that uniquely identifies a link to a file. 

In one embodiment of the invention, a method for 
resolving pathnames to file handles in an operating system is 
used. The method includes assigning a link Id number to a new 
file by performing the substeps of creating and storing a link 
to the new file from a pre-existing parent file; creating and 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1A shows an example of a directory structure- 

in a Unix ul' " " eXaBPle ° f * direct °^ tree structure 
in a Unix- like operating system; 

Fig. 1C shows one file system mounted on another; 

to th. „• !* 1D Sh ° WS ^ reSUlt ° f executin 9 a link command 
to the directory structure of Fig ic; 

mi , Fi9 ' 2 ^ 3n illustration °f basic subsystems in a 

Tziirr* systea suitabie for use with the *~ 

Fig. 3 shows a flowchart for a routine that is 
called when a link command to link a new name to a target 
mode is executed; and 

Link Ids anf* ' " fl ° WCha " f ° r a ™tine that uses the 

Lank Ids and parent lists in a file system to associate a 
pathname to a file handle. 

DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT 

fv«i , Fig * 2 " ^ illustration of basic subsystems in a 
typical computer system suitable for use with the present 
invention, m Fig. 2 , subsystems are represented by blocks 
such as central processor 10, system memory n, display 
adapter 12, monitor 13, etc. The subsystems are 
interconnected via a system bus 14. Additional subsystems 

Pelhe Y^"' keyb ° ard ' f " ed dl6k Md shown. 
Peripherals and input/output (I/O) devices can be connected to 
the computer system by, for example serial port 15. For 
example, serial port 15 can be used to connect the computer 
system to a modem or mouse input device. The interconnection 
via system bus 14 allows central processor 10 to communicate 
with each subsystem and to control the execution of 
instructions from system memory 11 or fixed disk 16, and the 
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exchange of information between subsystems. other 
arrangements of subsystems and interconnections are possible. 

The present invention uses the concept of a "link 
Id." The link Id is essentially a count of the number of 
links to a file. A lastLinkld field is kept for each inode in 
the operating system's directory structure. The lastLinkld 
field contains the link Id number for each name that has been 
assigned, or "linked, » to the inode. 

When a file is initially created, the associated 
inode 's lastLinkld field is set to 1 and the initial link to 
the file is also l. As such successive link is added, 
lastLinkld is incremented and its value becomes the Id of the 
new link. Mote that link Ids need only be unique within inode 

the first link assigned to any file always has link Id l, 
the second has link Id 2, etc. 

As an example, suppose that the root file system is 
empty and that file «/a" is created. The new file will have 
inode number 3 (because the root directory always gets inode 
number 2) and "a" will have link Id 1. The contents of the 
root directory are shown in Table I. 

Name inode number link Id 



3 

Table I 



Now, if the call link (»/a», »/b") is made, the 
directory will be as shown in Table II. 

Nam© inode number link Id 

I 3 ~ I 

b 3 2 

Table II 



Next, assume that directory "d" (inode number 4) is 
added to the root directory. The root directory's contents 
would then be as shown Table III. 
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Name 



inode number link Id 



b 3 r 

5 3 2 

° 4 i 

Table III 



If at this point, the call link ("/&», "/d/f") is 
made, the contents of the root directory are unchanged and the 
contents of directory4 are as shown in Table IV. 



Name inode number link Id 

f 3- g 

Table IV 

At this point, the lastLiakld field in inode 3 has 
value 3 because a total of three links to the file have been 
added since the file was created. 

Next, flowcharts are presented to describe steps in 
routines of the present invention for assigning Link ids and 
parent list entries to an inode after a link command is 
executed and for using Link Ids and parent list entries to 
resolve, or map, a pathname to a given file handle. In 
general, the flowcharts in this specification illustrate one 
or more software routines executing in a computer system such 
as computer system 1 of Pig. 2 . The routines may be 
implemented, by any means as is known in the art. For example, 
any number of computer programming languages, such as »c«, 
Pascal, FORTRAN, assembly language, etc., may be used. 
Further, various programming approaches such as procedural, 
object oriented or artificial intelligence techniques may be 
employed. 

The steps of the flowcharts may be implemented by 
one or more software routines, processes, subroutines, 
modules, etc. In some cases steps may be implemented by 
making use of operating system calls. It will be apparent 
that each flowchart is illustrative of merely the broad 
logical flow of the method of the present invention and that 
steps may be added to, or taken away from, the flowcharts 
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without departing from the scope of the invention. Further 
the order of execution of steps in the flowcharts may be 
changed without departing fro* the scope of the invention. 
Additional considerations in implementing the method described 
by the flowchart in software may dictate changes in the 
selection and order of steps. Some considerations are event 
handling by interrupt driven, polled, or other schemes, a 
multiprocessing or multitasking environment could allow steps 
to be executed "concurrently." For ease of discussion the 
implementation of each flowchart is referred to as if it is 
implemented in a single "routine". 

in Fig. 3, flowchart 100 is entered at step 102 when 
a link command to link a new name to a target inode, as 
described above, is executed. Before entering flowchart 100 
it is assumed that the target inode has a lastLinkld field 
associated with it. The lastLinkld field is initially set to 
1 for the original link to the target inode. The target inode 
has a Link Id of i. The name used to create the target inode 
is associated with the Link id and the target inode number as 
shown, for example, in Table I, above, for the link named "a » 
Each of the Tables represent information for inodes associated 
with the parent directory of the inodes residing in the parent 
directory. 

At step 104 the lastLinkld value for the target 
mode is incremented to generate a Link Id for the new link 
At step 105 the, value of lastLinkld is assigned to the linked 
name m the parent directory table, in the present example, 
the target inode only has one link, named "a," and the 
lastLinkld value was left at 1. Thus , the incremented 
lastLinkld value is 2. A new entry is made in the parent 
directory table for the parent directory of the target inode, 
namely the root directory. The new entry includes the linked 
name, "b,« along with the target inode number, 3, and the Link 
W 2. The routine of flowchart 100 exits at step 106. 

Thus, the flowchart of Fig. 3 illustrates a method 
for assigning Link Ids When the link command is executed in an 
operating system. 



WO 96/23268 

POYUS96/00783 

10 

Fig. 4 shows flowchart 200 for a routine that uses 
Link Ids in a file system to associate a pathname to a file 
handle. Flowchart 200 is entered at step 202 where it is 
assumed that a file handle including a child inode number a 
parent inode number, a file system number and a Link Id is 
passed. 

At step 204, the parent inode number and file system 
number are used to locate the parent directory. At step 206 
a test is made as to whether the directory was located or not 
If the directory was not located, the file handle is assumed 
to be "stale" and the routine is exited with an error via step 
213. 

At step 208, the directory table is searched looking 
for a match on the child inode number and Link id. At step 
210, a test i made as to whether a match was found or not. If 
a match was not found, the file handle is assumed to be stale 
and the routine is exited with an error via step 213. 

At step 212, the slash character (»•/») and the file 
name in the directory entry matching the child inode and Link 
Id are appended to the name of the parent directory to form a 
complete pathname. The name of the parent directory is 
obtained using whatever function the operating system uses to 
obtain the directory name. In the preferred embodiment, a 
procedure that follows the steps used by the Unix operating 
system command "getcwd" is used to obtain the name of the 
parent directory. 

Finally, the routine is exited at step 214. 

Thus, Pig. 4 shows a routine for mapping a file 
handle to a pathname where the file handle is for a file that 
can have multiple pathnames. 

In the foregoing specification, the invention has 
been described with reference to a specific exemplary 
embodiment thereof, it will, however, be evident that various 
modifications and changes may be made without departing from 
the broader spirit and scope of the invention as set forth in 
the appended claims. The specification and drawings are, 
accordingly, to be regarded in an illustrative rather than a 
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1. A method for resolving pathnames to file 
handles in an operating system executing on a computer system 
wherein the computer system includes a processor and a memory' 
wherein the operating system allows the creation of pathnames' 
by accepting a sequence of ordered links to define a pathname 
wherein each link includes a link name, wherein each link is ' 
associated between a parent file and a target file, wherein 
each file has a node number, wherein a given sequence of 
ordered links defines a pathname to the target file associated 
with the final link in the sequence, wherein the file handle 
for a given file includes the node number of the given file, 
the method comprising the steps of: 

assigning a link ID number to a new file by 
performing the following substeps 
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" creating and storing a link to the new file from a 

16 pre-existing parent file; 
17 

18 
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creating and storing a node number for the new file; 
storing a unique link ID number for the new file 
wherein the link ID number is unique among all links to the 
target file; and 

subsequent to the above steps, recreating the 
pathname to the new file by using the node number and link ID 
23 of the new file. 



The nethod of claim 1, wherein the operating 
system is an operating system having a UNIX file and directory 
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